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ERRATA  AND  ADDITIONS 

"Minimizing  Functionals  on  Normed-Linear  Spaces" 

by 

A.  A.  Goldstein 

Page  3.  In  (b)  of  the  theorem  replace  Lp[0,l]  by  a  uniformly  convex 
Banach  space. 

Page  4.  Line  7  from  bottom.  Assume  E  is  uniformly  convex.  By  [6]  p.  113. 

Page  8.  Line  6.  for  p  >  1. 

Page  10.  Insert  after  sentence  of  line  5:  and  f  is  uniformly  F-dif ferentlable 
on  S  if  p  >  2.  The  inequality  |a|2r  +  |b|2r 

-2|a|r|b|r[a,b]/|a| |b|  <.  |a|2r_2(r2  +  1)  |  a  -  b|2  where  |a|  >  |b|, 
and  a  direct  computation  show  that  the  F-derivative  f'  exists  and 
is  llpschltz  continuous  for  all  p  >  1. 

Page  11.  Line  12.  Delete  paragraph  beginning  with  "By  the  theorem  of  I..." 

Replace  this  with  the  following:  Moreover,  the  space  is  uniformly 

convex.  This  follows  by  a  theorem  of  Smulian  [12],  which  states  that 
if  the  norm  in  a  B-space  is  uniformly  F-dif ferentlable  on  the  unit 
sphere,  then  the  conjugate  space  is  uniformly  convex. 

Page  13.  Replace  ref  [6]  by  M.  Day,  Normed  Linear  Spaces,  Academia  Press, 

N.  Y.,  1962. 

Add: 

[12]  V.  Smulian.  Sur  la  derivabilitd  de  la  normed  dans  I'espace  de  Banach, 

C.  R.  (DoRlady),  Acad.  Sci.  U.R.S.S. ,  Vol.  27,  (1940),  p.  643-648. 
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ABSTRACT 


This  paper  extends  results  of  [1],  [2],  of  Goldstein,  and  [3] 
of  Vainberg  concerning  steepest  descent  and  related  topics.  An 
example  is  given  taken  from  a  simple  rendezvous  problem  in  control 
theory.  The  problem  is  one  of  minimizing  a  norm  on  an  affine 
subspace.  The  problem  here  is  solved  in  the  primal.  A  solution  in 
the  dual  is  given  by  Neustadt  [A]. 


I.  GENERATION  OF  MINIMIZING  SEQUENCES 


Let  E  be  a  normed  linear  space  (n.  1.  space),  an  arbitrary 

point  of  E  and  f  a  functional  defined  on  E.  Let  S  denote  the  level  set 

{x  e  E  :  f(x)  f(xQ)}  defined  at  an  arbitrary  fixed  xQ  e  E.  We 
denote  by  f'(x)  the  Frechet  or  F-derivative  of  f  at  x.  We  call 
f  uniformly  F-differentiable  on  S  if  f  is  F-differentiable 
on  S  and  If  6(c)  in  the  definition  of  the  F-derivatlve  is  constant 
on  S.  The  F-derivative  of  f  at  x  will  be  denoted  by  f'(:c).  If 
g  c  E*  the  value  of  g  at  x  will  be  denoted  by  [g,x],  and  if 
h  e  E**  the  value  of  h  at  g  c  E*,  by  (h,g] .  Recall  that  if  E 

and  F  are  n.  1.  spaces  and  A  is  bounded  linear  operator  from  E  to 

F,  in  short  A  c  B(E,F) ,  and  if  A  is  onto  then  A  1  exists  and 
belongs  to  B(F,E)  if  and  only  if  for  some  m  >  0  and  all  x  c  E, 

|  |  Ax  |  |  >_  m  1 1  x  1 1  ;  and  that  m||x|J  <_  |  |  Ax  |  |  <_  M  |  |x  |  |  for  all  x  in 
E  implies  that  M  *||y||  |  | A  *y||  <_m  *" |  |y|  |  for  all  y  in  F. 

We  observe  that  if  E  is  a  reflexive  Banach  Space,  A  c  B(E,E*)  and 
[Ax,x]  >_  m |  | x |  |  for  all  x  e  E,  then  A  is  onto  and  thus  has  an 
inverse.  For,  on  the  contrary  supposition,  take  f°  t  M  ■  range  A. 

Choose  g  in  E**  such  that  g(f^)  ■  dist(f^,M)  >0,  ||g||  ■  1  and 

g(f)  ■  0  for  all  f  in  M.  Take  g  in  E  so  that  [g,f]  ■  [f,g] 

for  all  f  in  E*.  Then  0  ■  (g.AxJ  -  [Ax,g]  for  all  x  in  E. 
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Thus  [Ag«g]  -  0  while  ||g||  -  |{g||  -  1.  0.  E.  D. 

Let  $  denote  a  bounded  map  from  S  to  E  satisfying  the  two 
conditions  [ f '  (x) ,$(x) ]  >_  0,  and  given  e  >  0  there  exists  6  >  0 
such  that  [f'(x),$(x)]  <  6  implies  ||f*(x)||  <  e.  Some  examples 
of  such  mappings  are  the  following: 

(1)  Let  A  e  B(E*,E)  such  that  ly.Ay]  >_o||y||^  for  all 

2 

y  e  E*  and  some  o  >  0.  Let  $(x)  *  Af'(x)  and  choose  6  •  e  o. 

Then  |  |f '  (x)  |  |  <  c. 

As  a  possible  candidate  for  the  operator  A,  suppose  f  is  twice 

F-dif ferentiable  on  E.  Assume  that  for  some  p  >  0,  and  some  x 

in  S  the  operator  f"(x)  in  E(E,E*)  is  onto  and  is  "oounded  belou) ", 

that  is,  the  bilinear  functional  satisfies  [f"(x)z,z]  ^p||z||  for 

all  z  in  E.  Then  ||f"(x)z||  ^_p||z||  showing  that  f"(x)  has  an 

Inverse  [f"(x)]  *  ■  A  c  B(E*,E).  Since  A  has  a  bounded  inverse,  there 

exists  a  number  o  >  0  such  that  |  [  Ay  |  |  o|  |y|  |  for  all  y  c  E*. 

2  2 

Set  z  »  Ay.  Then  [f"(x)z,z]  ■  [y,Ay]  >_  po  ||y||  showing  the 
candidacy  of  A. 

(2)  Suppose  E  is  a  reflexive  Banach  space.  By  the  weak  compactness 

of  the  unit  sphere  in  E  it  follows  that  for  some  zQ,  | |z  | |  *  1, 

[ f  * (x ) , zq]  -  I  I f  * (x) I  I .  Set  $ (x )  ■  zQ J  I f  * (x) | I .  Because 

2 

[f '  (x),$(x)  ]  *  |  |  f  *  (x )  |  [  ,  <£(x)  is  the  analogue  of  the  gradient  in 
Hilbert  space.  When  E  is  an  L^  space  the  point  z^  is  obtained  by 
considerations  of  equality  in  Holder’s  inequality. 
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(3)  Since  (  j  f  *  (x )  j  |  «  sup{  [f '  (x)  ,z] :  |  | z  |  |  «  1},  if  0  <  a  <  1 
a  point  zQ  exists  such  [f'(x),z0]  >_  a  |  |  f  '  (x)  |  | .  If  for  fixed  a  and 
all  x  e  S  we  can  find  such  zQ,  we  may  take  <J>(x)  ■  z  |  | f  *  (x)  |  |  • 

In  what  follows  let  A(x,p)  *  f(x)  -  f(x  -  p$(x))  and 


g(x,p)  ■  A(x,p)/p[f  *  (x)  ,<J>(x)] .  Assume  E  is  a  normed-linear  space  and 
S  is  the  level  set  of  f  at  xQ  in  E.  In  what  follows,  assume  0  <  o  < 
Theorem.  Assume  that  on  S  f  is  uniformly  F-differentiable  or  that 


the  F-derivative  f'  exists  and  is  uniformly  continuous.  Set  x^+^  *  x^, 
when  [f '  (x^)  ,(Ji(x^)  ]  *  0;  otherwise  choose*  p^  so  that  o  <  g(x^,p^)  1  -  o 

when  g(xR,l)  <  o  or  pR  -  1  when  g(xk,l)  >_  o,  and  set  xR+1  -  xR  -  Pk<> (xk> • 

(a)  If  S  is  bounded  or  f  is  bounded  below  then  {f'(xk)}  converges 

to  0  while  {f(xk)>  converges  downward  to  a  limit,  L.  If  S  is  compact, 
then  every  cluster  point  of  {x^}  is  a  zero  of  f'.  In  addition,  if  <J>(xk)  0 

and  f*  has  finitely  many  zeros,  {x^}  converges. 

(b)  If  S  is  convex  and  bounded  and  f  is  convex, 


L  ■  inf{f(x):x  c  S}  ■  6.  If,  in  addition,  E  is  a  reflexive  Banach  space, 

then  every  weak  cluster  point  of  {x.  }  minimizes  f  on  E.  If  E  ■  L  [0,1], 

k  p 

then  {x^1  converges  to  a  unique  mlnimizer  of  f. 

(c)  Assume  that  the  Gateaux  derivative  f"  exists  on  S  and  satisfies 
2  2 

y  ||  z  |  |  <_  [f"(x)z,zj  <_  M  |  |  z  |  |  for  all  xeS,  zeE  and  some  y  >  0. 

Assume  S  is  convex  and  E  is  complete.  Then  {x^}  converges  to  a  unique 
mlnimizer  of  f  on  E. 

If  the  Gateaux  differential  f"  satisfies  f"(x,h,h)  <_  | | h | | ^ /Pq  for  all 
h  in  E,  x  in  S  and  some  p^  >  0  we  can  choose  to  satisfy 

6  <_  <_  2pq  -  5  with  0  <  6  <_  p^.  The  method  of  steepest  descent  could 

also  be  employed.  See  [9]. 


The  proof  of  (a)  is  given  in  [1].  The  proof  there  is  stated 

for  E,  a  Hilbert  space,  but  the  same  proof  works  when  E  is 

taken  to  be  a  n.  1.  space.  Two  comments  might  be  made,  however. 

S  bounded  and  f'  uniformly  continuous  on  S  Implies  that  f*  is 

bounded  on  S.  (See  e.g.  [5],  p.  19.)  It  follows  by  employing  the 

mean  value  theorem  that  f  is  bounded  below  on  S.  The  statements 

f  uniformly  F-differentiable  and  the  F-derivative  f'  is 

uniformly  continuous  are  equivalent.  (See  [5],  p.  45.) 

(b)  Given  e  >  0  choose  z'  e  E  such  that  f(z’)  ■  0  +  r/2. 

Because  f'  exists  at  and  f  is  convex,  f(z')  >  f(x^)  +  IfOc^JiZ* 

Since  (f^x^)}  ■*  0  and  S  is  bounded,  for  all  k  sufficiently  large 

f(x^)  <_  f(z')  +  e/2  »  6  +  e,  showing  that  L  «  0. 

If  E  is  reflexive  and  S  is  convex,  closed,  and  bounded,  then  S 

is  weakly  compact.  Since  f  is  convex,  the  sets  (x  e  E:f(x)  k}  are 

closed,  convex,  and  weakly  closed,  for  all  k.  Thus  f  is  weakly  lower 

semi-continuous.  If  z  is  a  weak  cluster  point  of  {x^}  then  for  an 

appropriate  subsequence,  lim  inf  f(x.)  ■  L  >  f(z).  Assume  E  ■  L  [0,1] 

k  -  p 

and  f  is  the  norm  on  E.  By  [6],  p.  78,  if  {x^}  converges  weakly  to 
z  and  f(x^)  2  then  {x^}  converges  strongl)  to  z.  It  follows  that 

every  weak  cluster  point  of  {x^}  is  a  strong  cluster  point  of  {x^}. 
Since  f'  vanishes  at  every  weak  cluster  point  of  (x^}  and  f'  vanishes 
only  once  by  the  strict  convexity  of  f,  every  subsequence  of  {x^}  has 
the  same  weak  cluster  point  z,  showing  that  {x^}  converges  to  z. 
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(c)  The  hypothesis  of  (c)  imply  that  f'  is  Lipschitz  continuous  and  that 
the  set  S  is  bounded.  Otherwise  S  would  contain  an  unbounded  sequence,  say  (z^ 
By  Taylor’s  theorem  if  u  e  S,  f  (z^)  >_  f  (u)  +  |  |  z^  -  u  1 1  [  ( j  |  z^  -  u  ]  |  p/2  -  1 1  f '  (u)  |  |  ] , 
showing  that  f(zk>  1  f(x  ),  for  large  k,  whence  S  must  be  bounded. 

We  now  show  that  the  sequence  {xk}  is  Cauchy.  Again  by  Taylor's  theorem 

if  s  >  k,  f(xg)  -  f(xR)  >_  [f’(xk>,  xg  -  xk)  +  p||xg  -  xk|  |2/2.  Since  S 

is  bounded,  |  |x  -  x,  |  |  D  where  D  is  the  diameter  of  S.  Thus 

|  |  x  -x||2<^-^{f(x)-f  (x,  )  +  D  |  |  f '  (x.  )  |  | }  which  shows  that  {x  }  is 

S  K  p  8  K  K  8 

a  Cauchy  sequence.  By  the  completeness  of  E  {x  }  has  a  limit,  say  z, 

8 

in  E,  and  f'(z)  -  0.  If  z  is  not  unique,  t>»n  f'(z^)  ■  f'lZj)  •  0,  z^  i  z. 
Thus  f(z^)  -  f(z2)  >_  ^ll2]^  ”  z2N  —  ”  fCXj)*  *  contradiction. 

Hence  z  is  unique  and  is  a  minimizer  of  f. 

Remarks:  Useful  remarks  may  be  found  in  [1],  [3]  and  [9]. 

II.  NEWTONIAN  STEPS  AND  ACCELERATION 

Suppose  that  at  the  given  point  Xq,  the  function  f  satisfies  the 
conditions  of  the  first  example,  namely  $(x)  ■  f'^(x0)f '  (x) ,  where 

f"(x0)  ■  [f"(x0)]  *.  The  corresponding  iteration  is  xn+^  "  xn  ”  PnfM(Xg)f ' (xr)  . 

This  algorithm,  when  p  =  1  is  known  as  the  "modified"  Newton's  method  (see 

n 

[3],  p.  259,  or  [7],  p.  696).  In  a  similar  manner  if  f'^(x)  exists  and  is 

uniformly  bounded  below  on  S,  ve  may  define  $(x  )  ■  f"(x  )f'(x  ).  We 

n  -l  n  n 

shall  do  this  below.  It  is  clear  from  what  has  already  been  said  that  $ 
satisfies  hypotheses  of  the  above  theorem.  Our  object  now  is  to  formulate 
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an  algorithm  using  f''(x  )f'(x  )  ■  4>  (x  )  which  will  converge  at  a 

-In  n  n 

superliner  rate. 

In  the  following  we  set  A(x,p)  *  f(xl  -  f(x  -  pfj(x)f’(x))  and 
g(x,p)  -  A(x,p)/p[f’^(x)f '  (x) ,  f'(x)]. 

Theorem.  Assume  the  level  set  S  Is  a  convex  subset  of  a  Banach  space 

E.  For  each  x  in  S  assume  the  F-derivative  f"  is  continuous  on  S, 

f " (x )  is  onto,  j  | f " (x)  |  |  <_  M,  and  [f"(x)z,z]  >_  m  |  | as |  |  2  for  some  m  >  0 
and  all  z  in  E.  Set  “  xk  “  pk-l^Xk^  *^Xk^  *  w^ere  Pk  is  chosen 

so  that  for  6  <  1/2,  0  <  0  8(xk*Pk)  1  1  “  9  with  pk  -  1  if  possible.  Then: 

(a)  There  exists  a  number  N  such  that  if  k  >  N  then  p^  ■  1. 

(b)  There  is  a  unique  minimizer  of  f  and  the  sequence  {x^} 
converges  to  it  faster  than  any  geometric  progression. 

Proof.  We  have  for  all  x  in  S  that  M||z||^  >_  [f"(x)x,x]  >_  m  |  |  z  |  |  2 

and  m  1  |  |  y  |  |  2  ^  [y,  f '^(x)y  ]  mM  2  |  |  y  |  j  2 .  Thus  if  $(x)  -f^(x)f'(x), 

_2  2 

then  [  f '  (x) ,  $  (x)  ]  >_  mM  ||f'(x)||  ,  showing  that  <p  satisfies  the 
conditions  of  the  above  theorem.  Since  f"  is  bounded  on  S,  f'  is 
Lipschitz  continuous,  by  the  mean  value  theorem.  By  (c)  above  {xk} 
converges  to  a  unique  minimizer  of  f. 

Expand  A(x,p)  to  two  terms  in  the  Taylor  series  with  remainder 
[  f"(f  )h,h) ,  where  h  =  of”  (x )  f  '  (x) .  Set  f"(0  *  f"(x)  +  f"(0  -  f"(x). 


Ther  g(x,p)  -  1  -  p/2  -  p[(f"(C)  -  f' "(x)  )  f  ”(x)f  '  (x) ,  f”  (x)  f '  (x)  ] /2  [  f '  (x)  ,  fj'(x)  f  '  (x)  ] 


1  1  -  p/2  -  pj  |f"(0  -  f"(x)  |  iM 2 /2m3.  Thus  |g(x,p)  -  1  +  p/2  |  <  p  |  |f"(0 


-  f"(x)  '  fr/2m.  Since  t  (p.  )  lies  between  x.  and  x.  , ,  x„,f„,x.  , f  , .  . . 

k  k  k+1  0  0  1 


is  a  Cauchy  sequence;  and  it,  together  with  its  limit  z,  is  a  compactum. 
Consequently,  on  this  compactum  f"  is  uniformly  continuous,  so  that 
{ | | f "(C(p^)  -  f"(x) | |}  converges  to  0,  showing  that  the  choice  *  1 


is  eventually  feasible. 

To  prove  (b)  we  write 

xk '  2  •  pk!i,(xk)f”(xk)(xk 

Thus  ||xk+1-z||  -  ||xk-z-pk 


Vi  - 2  *  xk  - 2  -  pk!’,(xk)f,(xk)  = 

-  z)  +  Pkr(xk)[f"(xk)(xk  -  z)  -  f'(xk)]. 

(xk-z)|  I  +  Pk| |fj(xk)  I  |  I  |f”(xk)(xk-z)-f’(x 


k 


) 


Since  f '  is  F-dif ferentiable  at  xk,  ||f'(z)  -  f'(xk)  “  f"(xk) (z  ~  xk)  || 

<  c  I  |z  -  xk||.  Thus  I  |xk+1  -  z|  I  »  (1  -  pR)|  |xR  -  z||  +  Pktn  1e|  |z  -  xk|| 
Q.E.D. 

Remarks ; 

(1)  Both  sides  of  the  inverse  of  f"(x)  are  used  in  the  proof. 

(2)  The  analogue  of  the  modified  Newton  process,  namely  choosing  <J>  (x) 

■  f^'(x0)f'(x)  or  f”(xk)f*(x)  with  k  fixed  also  will  under  the  hypothesis 
of  the  above  theorem  generate  a  sequence  converging  to  a  unique  minimizer 
of  f.  Since  llxk+1“z||  -  I  lxk“Z-Pkf ^(xQ) f "(z)  (xk-z)  |  |  +  pk I  I  If” (*Q)  1  I  I  c 
|  | xk~ z  |  j m  *  when  |  |  xk-z  I  1  <  <5,  the  rate  of  convergence  is  eventually 
geometric  provided  ||l  -  pkf (x^) f " (z)  |  |  <  1.  Since 

III  -  Pkf”  (x0)f"(z)||  <  1  -  Pk  +  Pkl |f”(x0) I  I  I  I  (f"(x0)  -  f"<z)||,  if 
I |f"(x  )  -  f"(z) | |  is  sufficiently  small,  Pk  =  1  will  generate  a 
sequence  converging  to  z  at  the  rate  of  geometric  progression.  A  sufficient 
condition  for  the  global  geometric  convergence  would  be  (M/m)  <  1/2,  since 
I  I  f"(x)  |  |  1  M  and  |  |  f^'(x)  |  |  1  m  1. 

(3)  Pertinent  remarks  may  be  found  in  [3]. 
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III.  EXAMPLE 


a)  We  consider  the  following  problem  which  arises  from  a 

linearized  rendezvous  problem.  See  for  example  [8],  [9]  and  [4], 

In  [4]  this  problem  is  solved  in  the  "dual".  We  consider  here  a 

construction  in  the  "primal".  In  [8]  and  [9],  we  have  discussed 

this  problem  in  the  spaces  £ and  £^\  we  now  discuss  the 

problem  in  £  for  p  >  2.  Let  £  denote  the  direct  sum  of  n 

P  p 

Lp[0,l]  spaces.  Thus  a  point  x  c  £ if  x  *  (x^,...,xn)  and 

x  .  c  L  [  0, 1  ]  ;  the  norm  in  ^  will  be  I  I  x  I  I 
1  P  ..  „  ,  P  1  1  1  'P 


where 


x(t) 


r  n  "I  ,  / 

£  x?(o 
.  i*l  1  . 


[  J1|x(t)|pdt]1/p 


Since  /n"  max  {Xj(t)  :  1  ^  i  n}  >_  |x(t)|, 


| | x I |  is  well  defined.  Let  {u  :  1  <  i  <  m}  be  a  linearly  independent 

11  i 

set  in  £  .  Set  —  +  —  *  1.  Since  q  <  p,  u"  is  also  in  £  .  Given 

p  p  q  q 

numbers  a^,  (1  i  <_  m)  define  the  affine  subspace 
M  *  {x  c  £  :  [u*,x]  *  :  1  <_  i  <_  m} .  We  shall  consider  the  problem 

of  minimizing  f(x)  u  |  |x|  |P  on  M.  The  limits  p  -*■  1  and  p  ->  °° 
correspond  to  the  cases  of  rendezvous  with  minimum  fuel  and  minimum 
thrust  amplitude  respectively.  In  what  follows  we  shall  assume  for 
simplicity  that  n  *  2.  There  are  no  further  difficulties  in  the  general 
case. 

We  first  observe  that  if  the  Gateaux  differential  (G-dif ferential) 
of  f  exists  it  is  given  by: 
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i 

' (x)h  ~  P  f  |x(t)|P  2[x1(t)h1(t)  +  x2(t)h2(t)  ]dt 


r0 

-p  /  |x(t)|P’1  [t^uTT  hi 

Jo  L 

p/q 


x?(t) 

(t)  +  -r-TTTT  h_  ( t )  I  dt 


) 


( x ( t ) |  “2' 

i  pi  1*1  ip'  ‘u  iv  ip  +  I  lh2l  I pi - 

Here  Holder's  inequality  has  been  employed  on  the  function  t  *  |x(t)|P  ^ 

which  belongs  to  L  [0,1].  We  have  also  used  I  I* I  j  for  the  norm  in  L  [0,1]. 

q  p  P 

Thus  the  G-derivative  of  f  exists. 

Observe  now  that  if  the  second  G-dif ferent ial  exists  it  is  given  by: 

1 

[f"(x)h,k]  *=  p(p-2)  f  |  x  ( t )  j  P  ^  (Xj  (t  )hj  (t )  +  x2(t)h2(t))(x1(t)k1  (t)  +  x2  (t  )k2  (t )  )dt 

J°  1 

+  P  f  |x(t)|p"2(k1(t)h1(t)  +  k2(t)h2(t))dt 

1  Jo 

/  y  /  X  (t)  X?(t)  W  X  (t)  X  (t)  \ 

|x(t)l  (l7o)Thi(t)  +F(tTT  h2(t)AMoTki(t)  k2(t)Jdt 


i 

+  p  f  I x ( t ) I P  2(k1(t)h1(t)  +  kn(t)h2(t))dt 

•'o 

r1  r1 

1  2p(p-2)  /  |x(t)|P  2  |  h(t) ( | k(t) |dt  +  p  /  |x(t)|P"2|h(t) | |k(t)|d 


t  . 


If  x  c  L^,  y  r  and  z  e  L_,  and  ~  +  —  +  —  *  1,  then  /  | x (t )y (t )z (t )  | dt 

/ 0 


P  q  r 


<  X 


2 

z  |  |  ^ .  Since  the  function  t  -*  ]x(t)|P  belongs  to  L  ,  where 


.  _ 


1  .  2 


,  and  — ,  +  —  =  1  it  follows  that  if  u(t)  *  lx(t)lP  2, 
p-2  P  P 


|u(t)  |P  dtj  =  |  |x  |  |  P  2,  and  [  f  ”(x)h,k]  <_  (2p2  -  3p)  |  |x  |  |  P  2  [  |  h  ;  ^  ^ 

As  before,  let  S  denote  the  level  set  of  f  at  x^  where  x^  will  be 


ry  *■) 

subsequently  chosen  in  M.  Thus  if  x  t  f,  | |x| |P  =  I  f  (x ) )  “  <_  [  f  (x^) ) 

showing  that  [f"(x)h,k]  is  uniformly  bounded  on  S,  if  h  and  k  are 
confined  to  the  unit  sphere.  It  follows  by  Taylor's  theorem  that  f'  is 
F  -differentiable  on  S.  By  the  generalized  mean  value  theorem  it 

further  follows  that  f'  is  Lipschitz  continuous  on  S. 

We  now  construct  x,  on  M.  Let  x^  =  Thus  x,.  lies  on 

0  0  j  0 

M  if  and  only  if  L’c  [u*,u^]  =  a  .  We  show  that  the  null  space  of  the 

1  ^  1 

matrix  { [ u  ,uJ]}  consists  only  of  the  0  element  so  that  c^  is 

uniquely  determined.  If  for  some  c  |  0,  1c  .  [u1,^]  =  0  then 

J  2  J 

i  1  i  1  J 

T.c  T.c  [u  ,u  ]  *  [  Zc  u  ,Ec.uJ]  *  0,  contradicting  the  linear  independence 
i  J  i  J 

J  i  i 

of  the  set  (u  :  1  <  i  n}.  Let  N  *  {x  e  £  :  [u  ,x]  =  0,  i  =  l,...,m}. 

-  -  p 


We  now  choose  h  to  maximize  [f'(x),h]  subject  to  |  |h|  |  =  1  and 

h  c  iN .  The  maximum  is  achieved  because  the  sphere  meets  N  in  a 
weakly  compact  set  and  the  linear  function  (f'(x)**]  is  weakly  continuous. 


The  maximization  can  be  accomplished  by  the  method  of  Luler  multipliers 
(10).  Let  0(h)  =  ||h||P  -  1  and  Oj(h)  ■  [u*,h].  Then  a  necessary 
condition  that  h  maximize  f'(x,h)  subject  to  0(h)  *  0^(h)  =0  is 
that  there  exists  c.,  (1  <  i  <_m+l),  such  that  f'(x)k  = 

c.p  f  |h(t)|P  2(h  (t)k.(t)  +  h„(t)k  (t))dt  +  c.luj  l,k]  for  all  k 

0  j  =  2  J 


t  £ 


It  follows  that 


p  I X ( t ) I P  2x.(t)  =  pCj|h(t)|P  2hi(t)  +  c2u|(t)  +  ...+  cm+]ui(t)  i-1,2. 

i  ^  j 

Let  f  ^  ( t  >  =  (pCj)  [pjx(t)|P  “x.  (t)  -c0u^(t)  -...-  cm+jl,lj1(t )  1  ♦  ant^  observe  that 


I  h  ( t )  | 


2p-2  c2  .  .  ,  f2.  , 

'  =  f  (t)  +  f  2  ( t ) . 


I  £ 


h  .  ( t )  =  ((f2(t)  +  f2(t)2)Pf  ,(t)/(f2(t)  +  f2(t)) 


i 


1 


1 


I  here fore : 


11 


showing  that  h^  ^  L^.  We  now  solve  the  non-linear  equations  :  (h)  =  0, 
C^(h)  =0  (1  i  <_  m)  for  C2*...,c^+^.  If  necessary,  we  replace  h 
by  -h  to  ensure  t ha t  we  have  a  maximizer  rather  than  a  minimizer.  The 
solution  is  now  unique,  due  to  the  strict  convexity  of  the  sphere  in  £  . 
Because  of  the  uniqueness  of  the  extremal,  the  h  we  have  constructed 
must  be  this  extremal. 


The  subspace  N  is  also  an  £  space.  Minimizing  f(x)  on  M 
is  equivalent  to  minimizing  f(y  +  x^)  on  N,  with  x  =  y  +  x^.  Clearly 
the  gradient  of  che  function  f  restricted  to  N  is  h[f'(x),h).  (See 
12,  above.)  It  follows,  therefore,  if  4  (x)  =  h(f'(x),h],  then 
satisfies  the  conditions  required  for  the  theorem  of  I. 


By  the  theorem  of  I  we  may  infer  that  cverv  weak  cluster  point  z  of 
k  k 

the  sequence  {x  }  minimizes  f  and  f  (x  )  converges  downward  to  f(z). 


Furthermore,  since  z  is  unique  in  the  above  problem,  ix  -*•  z .  We 

k,  „  i 


now  show,  moreover,  that  {x 
(1  <  i  <"  n)  we  have  that  0 


z.  For  each  component  x  of  x, 

x  (t )  1  ^*dt .  Since 


r  1  x 1  ( t  > !  pdt  /* 

*0  *'0 

1 

0 


i 


if(x^)}  converges,  the  numbers  v^  = 


0 

x.*(t)  ^dt  are  bounded  and  the 
k 


sequence  { x  ^  }  has  a  weak  cluster  point.  In  fact,  {v*}  converges.  To 

K  k 


prove  this,  observe  that  since 


1 , 


V 


z^.  Take  a  subsequence 


lx,  }  such  that 
k 


*  y  .  Since  lx.  }  converges  both  weaklv  and  in  I. 

k  r 


norm,  lx*}  converges  strongly.  Thus  Ix^r  *  z'»  say.  By  continuity 
f (x  )  »  f(z’)  =  f(z).  By  uniqueness,  z  =  z'.  Therefore,  the  subsequence 

K 

lx/}  *  z*.  Suppose  that  { v/  }  had  another  cluster  point  y*  i  v*. 

K  K 
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Tike  a  new  subsequence 
and  therefore,  { y *  }  * 


z 


such  that 
(  ) ! Pdt  = 


,  i  ,  -i  »  ,  i  ,  i 

i v^ r  *  v  .  Aga in  1 x^ ‘  *  z  , 

vX,  a  contradict  ion.  It  follows. 


therefore,  that  {x  }  *  z . 

k 

(b)  The  above  processes  require  that  at  each  cvcle  a  non-linear 

system  be  solved  to  determine  the  gradient.  This  can  be  circumvented  bv 

imbedding  the  problem  into  a  Hilbert  space.  Specifically,  assume  that  the 

components  of  ul  are  bounded  and  measurable.  Let  £^  denote  the  direct 

sum  of  I.  [0, 1  ]  analogously  to  the  above,  and  define 

M’  =  x  •  £2  :  lu  ,x|  =  /  :  1  i  m  .  Let  f  now  be  defined  on  M’. 

Since  f  achieves  a  minimum  on  M  and  M  CM',  f  also  achieves  a 

minimum  on  M'.  Because  M  is  dense  in  M',  the  minima  are  equal.  The 

gradient  of  f  on  M'  is  merelv  the  restriction  of  the  gradient  of  f  in 

£.,  to  M'  and  is  obtained  bv  orthogonal  projection.  See  | 9 ) .  In 

general,  f'(x)  does  not  exist.  But  if  x  is  bounded  and  measurable, 

i.e.,  x  .  £  ,  f  '  (x)  £*  and  f(x)  £ The  set  S  is  bounded  in 

£  and  this  implies  S  is  bounded  in  £„%  since  '  x  '  ■  Mxll  if 

p  I  —  p 

P  2. 


Since  f  is  convex  and  continuous,  S  is  closed  bounded  and  convex; 

furthermore,  the  derivatives  of  f  are  densely  defined  on  S.  Assume 

x  •  M',  x  £  and  u*  £  ,  (I  i  •  m).  Then  x  ,  ,  is  well  defined 
n  n  "  ■  •  —  —  n*  1 

ind  is  also  in  £  .  To  set  this,  verify  that  7f (x  )  £  and  the 

■  n  • 

projection  of  f  (x^)  on  the  set  i  x  :  lu'.xl  =0,  (1  •  i  n )  1  is 

a  1  so  in  £  .  We  are  able  to  conclude  again  therefore  that  there  is  a 
unique  minimizer  z,  and  that  ix^f  *  z  and  f(x^)  ^kf(z). 
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